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ABSTRACT 

Measurements of X-ray scaling laws are critical for improving cosmological constraints derived with 
the halo mass function and for understanding the physical processes that govern the heating and 
cooling of the intracluster medium. In this paper, we use a sample of 206 X-ray selected galaxy 
groups to investigate the scaling relation between X-ray luminosity (Lx) and halo mass (M200) where 
M200 is derived via stacked weak gravitational lensing. This work draws upon a broad array of multi- 
wavelength COSMOS observations including 1.64 degrees 2 of contiguous imaging with the Advanced 
Camera for Surveys (ACS) to a limiting magnitude of Ifsi4W = 26.5 and deep XMM-Newton/Chandra 
imaging to a limiting flux of 1.0 x 10~ 15 erg cm~ 2 s~ 4 in the 0.5-2 keV band. The combined depth 
of these two data-sets allows us to probe the lensing signals of X-ray detected structures at both 
higher redshifts and lower masses than previously explored. Weak lensing profiles and halo masses are 
derived for nine sub-samples, narrowly binned in luminosity and redshift. The COSMOS data alone 
are well fit by a power law, M200 oc (Lx) Q , with a slope of a — 0.66 ± 0.14. These results significantly 
extend the dynamic range for which the halo masses of X-ray selected structures have been measured 
with weak gravitational lensing. As a result, tight constraints are obtained for the slope of the M — Lx 
relation. The combination of our group data with previously published cluster data demonstrates that 
the M — Lx relation is well described by a single power law, a — 0.64 ±0.03, over two decades in mass, 
M 200 ~ 10 13 - 5 - 10 15 5 /i7 2 1 M o . These results are inconsistent at the 3.7<r level with the self-similar 
prediction of a = 0.75. We examine the redshift dependence of the M — Lx relation and find little 
evidence for evolution beyond the rate predicted by self-similarity from z ~ 0.25 to z ~ 0.8. 

Subject headings: cosmology: observations - gravitational lensing - large-scale structure of Universe 
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1. INTRODUCTION 

Groups and clusters of galaxies, formed through the 
gravitational collapse of massive dark matter halos, are 
now readil y identified up to redshift one and even be - 
yond (e.g. jstanford et al.|[2006| |Eisenhardt et al.||2008| ). 
Baryonic tracers such as red-sequence galaxies, typically 
abundant at the centers of groups and clusters, or X- 
ray emission from the hot intracluster medium (ICM), 
have proved to be especially successful in t his task (e.g. 
'Gladders fc~Yeel[2005l |Koester et aT] [20071 |Finoguenov] 
et al.||2007p . Nonetheless, these observables only trace 
the tip of the iceberg given that the vast majority of the 
underlying mass is in the form of dark matter. 

The quantification of the total mass (both dark and 
baryonic) of groups and clusters of galaxies is an im- 
portant endeavour from both a cosmological and an as- 
tronomical standpoint. In particular, several lines of 
research would benefit from a clearer understanding of 
the relationship between the total halo mass of groups 
and clusters and their baryonic tracers. We outline sev- 
eral briefly here (for a recent review on this subject see 
Voitp005i From a cosmological perspective, the num- 
ber density of groups and clusters as a function of total 
mass is of fundamental interest because it is sensitive to 
both the expansion and growth history of the universe 



such as fi m , a 8 , and Qa (e-fl- |White et al. 1993; Wang & 


Stcinhardt 1998 




tiaiman et al.| 12001; 


Kozo et al.||2004; 


Wang et al.|2004 


a 


Bahcall et al.|2004[ 


Rozo et al.|2009|). 


Furthermore, moc 


ifications to the laws of gravity which 



can be invoked as a possible physical mechanism for ac- 
celeration, could imprint telltale signatures in the abun- 
dance and d ark matter structure of groups and cluster s 



of galaxies ( Rapetti et aL]|2008j |Schmidt et al]|2008 l. 
Unfortunately, the common difficulty encountered with 
each of these enquires is that theories and simulations 
make dark matter based predictions but our most acces- 
sible observables (such as richness or X-ray luminosity) 
are baryonic in nature. 

It has long been recognized that baryonic observables 
are subject to complex and poorly understood physical 
processes that make them imperfect dark matter tracers. 
For example, X-ray studies discovered early on that the 
theory of pure gravitational collapse which makes simple 
predictions for the shape and amplitude of X-ray scaling 



relati ons (also known as the self-similar model, Kaiser 



19861, fails to match observations s uch as the slope and 



normalization of the Lx -T relation ( Voit 2005 and refer- 
ences therein) implying that other non-gravitational (and 
still much debated) processes have significantly affected 
the thermodynamic state of the ICM. Additional heat- 
ing and cooling mechanisms that are invoked to solve 
this puzzle lead to different predictions for the shape 
and redshift evolution of X-ray scaling relations. On the 
one hand, the fact that X-ray scaling relations deviate 
from simple models is a plague for cosmologists because 
there is no straightforward recipe to estimate total halo 
masses. On the other hand, from an astronomical per- 
spective, the comparison between X-ray observables and 
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total halo mass contains valuable clues about the physical 
processes that govern galaxy formation and the heating 
and cooling of the ICM. 

For all of these reasons, more precise measurements 
of the mean and scatter in the relationship between to- 
tal halo mass and various baryonic tracers of groups and 
clusters of galaxies are highly desirable (e.g. see discus- 



sions in 



Voit (2005) and Albrccht et al 



At present, there are five popular met 



(2006}) 

Tods for detect- 
ing groups and clusters o f galaxies: a) optical detection 
via the red -sequence (e.g. |Gladders fc Yee|[2005{ |Koester 



et al.|2007| , b) detection via the Sunyaev-Zeldovich (SZ) 



effect which measures the distort ion of the CMB spec- 



trum due to the hot ICM (e.g. Sunyaev fe; Zeldovich 



19701 

Staniszews 



U972 



Carlstrom et al.||2002| |Benson et al-H^UH 



/ich| 



d et al. | 2008[), c) detection via X-ray emis- 



sion (e.g. Bohringcr et al. 2000; Hasinger et al. | |2001 
Finoguenov et al. 2007; Vikhlinin et al. 2008), d) spectro- 
scopic identificatio n (e.g . Cerke et al.||2005| |Miller et al. 
20051 iKnobel e t al.|2009 ), and e) detection via weak lens- 



mg maps (e.g. Marian fc Bernstein||2006| |Miyazaki et al. 



2007 Massey et al.|2007 l. This last technique is the sim- 



plest in terms of the underlying physics and is the only 
method for which the total halo mass can be directly 
probed, independently of both the baryons and the dy- 
namical state of the cluster. However, shear maps can 
only detect the most massive systems (M > 10 14 M Q ) 
and are limited to moderate redshifts because the lensing 
weight function peaks mid-way between the source and 
the observer, with galaxy shapes increasingly difficult to 
measure at z > 1. X-ray observations on the other hand, 
can more simply probe complete samples of groups and 
clusters, but departures from virial equilibrium and non- 
thermal pressure components in the I CM can bias X-ray 
based hydrostatic mass estimates (e.g. Nagai et al.|2007 l. 
The SZ effect has the attractive property of being red- 
shift independent, and the integrated SZ flux increment, 
Y, ma y be less sensitive to the baryo n physics of cluster 
cores (Motl et al. 2005 Nagai 20061 but mass measure- 
ments with the ti'Z effect face other challenges suc h as the 
identification and removal of radio point-sources QVale fc| 
|White||2006|), sky confusion owing to projection effects 
" White et al.| [2002 ) , and possibly a larger scatter in the 
M relati on than previousl y estimated due to feedback 
processes ( Shaw et al.||2008 1. 

Given these considerations, a promising strategy is to 
employ a robust and efficient cluster detection method 
(to which the ultimate solution may be a combination of 
sever al techniques such as described in |Cohn fe^W hitc 



2009 1 and to perform an absolute mass calibration of 



baryonic tracers via weak gravitationa l lensing ( Hoekstra 
|2007a[ |Rykoff et al.||2008| |Berge et aF " 



2008) 



The focus of this paper is to advance these goals by cal- 
ibrating the slope and amplitude of the M — Lx relation 
for galaxy groups using cross-correlation weak lensing 
in the COSMOS survey (also called "group-galaxy lens- 
ing"). Extending weak lensing measurements into the 
group regime is particularly important in order to extend 
the dynamic range of weak-lensing-based mass-estimates 
so as to more accurately determine the slopes of scaling 
relations. Using the COSMOS sample, we show that X- 
ray detections span a more complete and wide range of 
redshift and mass than detections via shear maps. In- 
deed, high redshift and small structures are challenging 
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to detect directly with shear because of the shape of the 
lensing weight function (see Q. Nevertheless, once they 
have been identified, groups and clusters can be stud- 
ied via stacking techniques. A notable advantage of this 
method is that measurements are unaffected by uncorre- 
cted mass along the linc-of-sight whereas mass estimates 
for individually detected clusters are subject to ~ 20% 



uncertainties (Mctzlcr et al. 2001 


Hoekstra 2003 


de Put- 


ter & White 2005). The associated drawback wit 


ti stack- 



is difficult to recover. 

In order to employ the stacked weak lensing technique, 
tight baryonic tracers of halo mass are highly desirable. 
The X-ray luminosity of groups and clusters is consid- 
ered to be a reasonable tracer of halo mass with a log- 
arithmic scj^ttej^jnjyie^M — Lx relation of roughly 20% 
to 30% (|Stanek et al.|[2006] |Maughan|[2007l |Pratt et al. 



20081 [Rozoei al. | ij 00 8 1 |Ry kott et, al.|^U08||Vikhlinm et, a 

2009 ) . A large fraction of this scatter has been shown to 
be associated with the presence of cool-cores and simple 
excisio n techniques can reduce the sca tter to sub-20% 
levels ( |Maughan|2007]|Pratt et al.|2008[ ). Although more 
tightly correlated mass tra cers have been identifi ed such 
as the Yx parameter (e.g. Kravtsov et al. 2006) - such 
indicators require the measurement of an X-ray spectrum 
which is not possible for most survey data where count 
rates are low. Our choice of Lx as a mass proxy reflects 
that fact that it is a simple X-ray observable, accessible 
with survey quality data, and the only one that can be 
easily measured at high rcdshift. Temperature measure- 
ments may be feasible for a small fraction of high rcdshift 
objects but cosmological studies that require complete 
samples of high rcdshift groups and clusters will need 
simple mass proxies like Lx- The details of the M — Lx 
relation are also important (regardless of the choice of a 
mass proxy) for determining effective volu mes as a func- 
tion of mass in X-ray flux l imited surveys ( Stanek et al 
2006] |Vikhlinin et al.|[2009| ). 



The layout of this paper is as follows. The data are 
presented in fpjand the theoretical lensing background is 
developed in ^3] The construction of the group catalog 
and the lens selection arc specified in fj4] Details regard- 
ing the adopted form of the M — Lx relation are given 
in f|5] Our main results are presented in f|6] followed by 
our assessment of the systematic errors in |7[ Finally, we 
discuss the results and draw up our conclusions in EjSj 

We assume a WMAP5 ACDM cosmology with Q m = 
0.258, fl A = 0.742, fl b h 2 = 0.02273 n a = 0.963, 
cr 8 = 0-796, H = 72 h 72 km s" 1 Mpc" 1 (|Hinshaw et al 



20091). All distances are expressed in physical units of 
hj 2 Mpc. X-ray luminosities are expressed in the 0.1- 
2.4 keV band, rest-frame. The letter M denotes halo 
mass in general whereas M200 is explicitly defined as 
M200 = M(< r 20 o) = 200/9 cr i t (z)|7rr| 00 where r 20 o is 
the radius at which the mean interior density is equal to 
200 times the critical density (p cr it = 3if 2 (z) /8irG). The 
function E(z) = H(z)/Hq = ^/f2 m (l + z) 3 + f^A repre- 
sents the Hubble parameter evolution for a flat metric. 
All magnitudes are given on the AB system. 

2. THE COSMOS SURVEY 

The COSMOS survey brings together a broad array 
of panchromatic observations with imaging data from X- 



ray to radio wavelengths and a large spectr oscopic follow- 
up program (zCOSMOS) with the VLT dScoville et al. 



2007| |Koekemoer et aL]|2007| |Lilly et al.||2007] . In par 

ticulax, the COSMOS program has imaged the largest 
contiguous area (1.64 degrees 2 ) to date with the Hub- 
ble Space Telescope (HST) using the Advanced Camera 
for Surveys (ACS) Wide Field Channel (WFC). In addi- 
tion to the ACS/WFC (I F8 i4w) imaging, the COSMOS 
field has been targeted by both the XMM-New ton (1.5 
Ms, |Hasinger et al.|2007| [Cappelluti et al.||2009D a nd the 
Chandra observatories (1.8 Ms, pvis et al.||20D9| . The 
combination of ACS imaging to provide accurate shape 
measurements, and of XMM-Newton /Chandra imaging, 
sets the stage for the study of the dark matter halos of 
galaxy groups via weak lensing techniques. In the fol- 
lowing sections, we describe the various data-sets and 
catalogs employed in this analysis. 

2.1. ACS Lensing data 

Our general scheme for the construc tion of the COS- 
MOS ACS lensing catalog is based on Leauthaud et al. 
(20071 and we refer the reader to this publication for 
details regarding the source extraction and catalog con- 
struction - only a brief review will be given here. Since 



Leauthaud et al. (20071, however, we have made a key 
improvement regarding the effects of Charge Transfer In- 
efficiency (CTI) in the ACS CCDs. This aspect in par- 
ticular is outlined belo w in g reater detail. 



(2007} and jRhodes et al.| p007| 

JOSlVlOS ACS images are strongh 



In Leauthaud et al 
we remarked that the COSMOS ACS images are strongly 
affected by CTI. As charge is transferred during the CCD 
read-out process, a certain fraction is retained by charge 
traps (created by cosmic ray hits) in the pixels. This 
causes flux to be trailed behind objects as the traps grad- 
ually release their charge, spuriously elongating them in 
a coherent direction that mimics a lensing signal. Our 
previous work employed a parametric scheme to correct 
for this effect at a catalog level. Although a paramet- 
ric scheme provides a first order correction of CTI, it 
neglects the dependence of the CTI on object size, ra- 
dial profile, and sha pe for example. For this reason, in 
Massey et al. ( 2009), we have developed a physically mo- 
tivated correction scheme that operates on the raw data 
and that has been shown to achieve a 97% level of correc- 
tion. Using this scheme, we have produced a new set of 
raw "unrotated" ACS/WFC data (version 2.0) in which 
the CTI trailing is reduced by m ore than an order o f 
magnitude (for further details, see Massey et al.||2009] ). 
The raw data are co-added using the same MultiDrizzle 
set-up as in our previous work. 
We use Version 2.5.0 of the SE xtractor photometry 



package (Bertin & Arnouts 19961 to extract a source 
catalog of positions from the y2.0 ACS data us i ng th e 
same "Hot-Cold" method as in Leauthaud et al. (20071. 
Defects and diffraction spikes are carefully removed from 
the catalog, leaving a total of ~ 1.2 x 10 6 objects to a 
limiting magnitude of Ifsi4W = 26.5. 

The next step is to measure the shapes of galaxies and 
to correct them for the distortion induced by the tim e 
varying ACS PS F as described i n [Rhodes et ~ aT1 (|2007|. 

~ P007| ) 



As compared to Rhodes et al 



the parametric 

CTI correction is no longer applied because this effect 
has already been removed in the raw images. 
Simulated images are used to derive the shear suscep- 
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tibility factors that are necessary in order to transform 
shape measurements into unbiased shear estimators. Fi- 
nally, for every galaxy we derive a shape measurement er- 
ror and utilize this quantity to extract the intrinsic shape 
noise of the galaxy sample. Representing a number den- 
sity of 66 galaxies/arcminute 2 , the final COSMOS weak 
lensing catalog contains 3.9 x 10 5 galaxies with accurate 
shape measurements. 

2.2. Photometric and Spectroscopic Redshifts 

Redshift information is critical for both the lens and 
source populations because it allows one to correctly 
scale lensing relations, to remove foreground contamina- 
tion, and to study weak lensing signals in terms of physi- 
cal instead of angular distances. We use an updated and 
improved version of t he photome t ric re dshifts (hereafter 
photozs) presented in Ilbert et al. (2009 1 which have been 
computed with over 30 bands ot multi- wavelength d ata. 
The main differences between the |Ilbert et al. (2009) cat- 
alog and the one that we use here is the addition or deep 
H band data and small improvements in the template 
fitting technique. Details r egarding the data and the 
photometry can be found in Capak et al. (20071. 

Photozs were estimated using a x z template fitting 
method (Lc Phare) and compared with large spectro- 
scopic samples from the Very Large Telesc ope (VLT) 
Visible Multi-Object Spectrograph (VIMOS) ( [Lilly et al. 
20071 and the Keck Deep Extragalactic Imaging Multi- 
Object Spectrograph (DEIMOS). The combined spec- 
troscopic redshift sample comprises: 10801 galaxies at 
z ~ 0.48, 696 at z ~ 0.74, and 870 at z ~ 2.2. The 
dispersion in the photozs as measured by comparing 
to the spectroscopic sample is OA z /( 1+zspec ) = 0.007 at 



AB 



< 22.5 where Az 



Zphot- The deep IR and 



Infrared Array Camera (IRAC) data enables the photozs 
to be calculated even at fainter magnitudes with a rea- 
sonable accuracy of cr^ 2 /( 1+zspoc ) = 0.06 at i\ B ~ 24. In 
particular, deep J, H , K, and u* band data allow for a 
better estimation of the photozs at z > 1 via the 4000A 
break which is shifted into the infrared (IR). 

Larger samples of spectroscopic redshifts at z > 1 
will ultimately be required to define the most trustwor- 
thy magnitude and redshift range for the source gala x- 
ies (in a similar fashion to |Mandelbaum et al. p008)). 
Meanwhile, to mitigate the effects of photoz uncertam- 
ties, we use a conservative source galaxy selection which 
will be discussed in more detail in <|7j In short, we re- 
ject all source galaxies with a secondary peak in the 
redshift probability distribution fu nction (i.e. the pa- 



rameter ZP_SEC is non zero in the Ilbert et al. ( 2009 1 
catalog). This cut is aimed to reduce the number of 
catastrophic errors (a preferential shift in a certain pop- 
ulation of galaxies from one redshift bin to another) in 
the source catalog. The ZP_SEC> galaxy population is 
expected to contain a l arge fraction of catastrop hic er- 
rors (roughly 40%-50%, (ffierTeF^][2006| [2(XK|. The 
photoz quality cuts reduce the number density of source 
galaxies from 66 to 34 galaxies/arcminute 2 . The fi- 
nal mean redshift and magnitude of the source sample 
is (z) ~ 1 and (Ifsi4w) ~ 24. In addition to these 
quality cuts, for each lens-source pair, we demand that 



-z\ c 



> °68% (^source) and that z s , 



2.3. Stellar Mass Estimates 

Stellar masses are used to iden tify the Most Mas- 
sive Central Galaxy (MMCG) (see j |4.4| for more details) 
and are estimated using the Bayesian code described in 
Bundy (2006a) and Bundy et al. (2006b). Briefly, an ob- 
served galaxy's spectral energy distribution (SED) and 
pho toz is referenced to a grid o f models constructed using 
the Bruzual & Chariot (20031 synthesis code. The grid 
includes models that vary in age, star formation history, 
dust content, and metallicity. At each grid point, the 
probability that the observed SED fits the model is cal- 
culated, and the corresponding stellar-mass to K-band 
luminosity ratio and stellar mass is stored. By marginal- 
izing over all parameters in the grid, the stellar mass 
probability distribution is obtained. The median of this 
distribution is taken as the stellar mass estimate, and the 
width encodes the uncertainty due to degeneracies and 
uncertainties in the model parameter space. The final 
uncertainty on the stellar mass also includes the K-band 
photometry uncertainty as well as the expected error on 
the luminosity distance that results from the photoz un- 
certainty. The typical final uncertainty is 0.2-0.3 dex. 

2.4. XMM and Chandra data 

The entire COSMOS region has been mapped through 
54 overlapping XMM-Newton pointings and additional 
Chandra observations cover the central region (0.9 
degrees 2 ) to higher resolution. A composite XMM- 
Newton and Chandra mosaic has been used to detect 
and measure the fluxes of groups and clusters to a 4er 
detection limit 1 of 1.0 x 10 -15 erg cm" 2 s~ 4 over 96% of 
the ACS field. The general data re duction process can be 
found in Finoguenov et al. (20071 and details regarding 
improvements and modifications to the original catalog 
are given in fj4] In particular, the group and cluster 
catalog used in this paper features a more conservative 
point- source removal procedure than in|Finogucno v~et al.| 
(20071. Redshift identification has also improved thanks 
to the increased photoz accuracy and to the availability 
of more spectroscopic data (see S4.3I. In total, the cat- 
alog used in this paper contains 206 X-ray groups and 
clusters of galaxies over 1.64 degrees 2 , spanning the red- 
shift range < z < 1.6 and with a rest-frame 0.1-2.4 
keV luminosity range between 10 41 and 10 44 erg s _1 . 

3. THEORETICAL LENSING FRAMEWORK 

3.1. From galaxy shapes to AE 

In the weak gravitational lensing limit, the observed 
shape £ b s of a source galaxy is directly related to the 
lensing induced shear 7 according to 



£obs — £int + 7; 



(1) 



-Zl c 



> 0.1 



to ensure a clean selection of background galaxies. 



where £i n t is the source galaxy's intrinsic shape that 
would be observed in the absence of gravitational lensing. 
In our notation, £j nt , £ bs, and 7 are spin-2 tensors. The 
above relationship indicates that galaxies would be ideal 
tracers of the distortions caused by gravitational lens- 
ing if the intrinsic shape e- mt of each source galaxy was 
known a priori. However, lensing measurements exhibit 

1 Quoted detecti on limits correspond to t he wavelet scale-wise 
reconstruction. See |Finoguenov et al.| ( [2007^ for more details. 
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an intrinsic limitation, encoded in the width of the ellip- 
ticity distribution of the galaxy population, noted here as 
(Tint , and often referred to as the "intrinsic shape noise" . 
Because the intrinsic shape noise (of order a- m t ~ 0.27, 



Leauthaud et al.|2007| ) is significantly larger than 7 (typ- 
ically 7 ~ U.U5 tor this work), shears must be estimated 
by averaging over a large number of source galaxies. 

Throughout this paper, the gravitational shear is noted 
as 7 whereas 7 represents our estimator of 7. The un- 
certainty in the shear estimator is a combination of un- 
avoidable intrinsic shape noise, a mt — \J ( e ? n t)> anc ^ °^ 
shape measurement error, <7 meas : 



4 



nt + ^mcas- (2) 

We will refer to 07 as "shape noise" whereas c^t will 
be called the "intrinsic shape noise" . The former in- 
cludes the shape measurement error and will vary ac- 
cording to each specific data-set and shape measurement 
method. Averaged over the whole COSMOS field, the 
weak lensing distortions represent a negligible perturba- 
tion to Equation [2j The intrinsic shape noise and mea- 
surement error for COSMOS have been characterized in 



Leauthaud et al.| ( |2007[ ) by using a sample of 27000 source 
galaxies that lie within the overlapping regions of adja- 
cent pointings. The shape measurement error is deter- 
mined for every source galaxy as a function of size and 
magnitude. For this paper, an intrinsic shape noise of 
fint = 0.27 is assumed. 

The derivation of our shear estimator is presented in 
|Leauthaud et al.| (|2007|. We employ the RRG method 

for further details) for galaxy 



(see iRhodes et al 



(2007). 



shape measurements! Briefly, we form 7 from the PSF 
corrected ellipticity according to 



7 = Cx 



^obs 



(3) 



where the shear susceptibility factor 2 , G, is measured 
from moments of the global distribution of e b s and 
ot her, higher order sha pe parameters (see equation 28 

Using a set of simulated im- 
Shear TEsting Pro gram (STEP; 



Rhodes et al. 2000 



ages similar to those o 



Heymans et al. 2006 



Massey et all 20061) but tailored 



exclusively to this data-set, we tind that, in order to cor- 
rectly measure the input shear on COSMOS-like images, 
the RRG method requi res an overall calibration factor of 

C = (0.86™)- 



(see 



Leauthaud et al. 



2007 



for more 



details). 

The shear signal induced by a given foreground mass 
distribution on a background source galaxy will depend 
on the transverse proper distance between the lens and 
the source and on the redshift configuration of the lens- 
source system. A lens with a projected surface mass 
density, S(r), will create a shear that is proportional to 
the surface mass density contrast, A£(r): 



AE(r) = £(< r) - E(r) = E crit x lt(r). 



(4) 



Here, E(< r) is the mean surface density within proper 
radius r, E(r) is the azimuthally averaged surface den- 
sity at radius r (e.g. Miralda-Escude|[l991 Wilson et al. 



2 Not to be confused with Newton's constant which we have 
noted Gn- 



2001 1, and jt is the tangentially projected shear. The 



geometry of the lens-source system intervenes through 
the critical surface mass density, £ cr it, which depends on 
the angular diameter distances to the lens (Dol), to the 
source (-Dqs)j and between the lens and source (-D LS ): 



Enrit, — 



Dos 



4ttG n D lD ls 



(5) 



where Gn represents Newton's constant. Hence, if red- 
shift information is available for every lens-source pair, 
each estimate of 7t can be directly converted to an esti- 
mate of AE which is a more desirable quantity because 
it depends only on the mass distribution of the lens. 

To measure AE(r) with high signal-to-noise, the lens- 
ing signal must be stacked over many foreground lenses 
and background sources. For every zth lens and jth 
source separated by a proper distance ry, an estimator 
of the mean excess projected surface mass density at r.^ 
is computed according to: 



A£jj(r,-j) 



Tt,ij X Ecrit.zji 



(6) 



where jt,ij is the tangential shear of the source rela- 
tive to the lens. The COSMOS photometric redshifts 



described in {2.2 are used to estimate £ cr it jj for every 
lens-source pair. In order to optimize the signal-to-noise, 
an inverse variance weighting scheme is employed when 
AE^ is summed over many lens-source pairs. Each lens- 
source pair is attributed a weight that is equal to the 
estimated variance of the measurement: 



(7) 



In this manner, faint small galaxies which have large 
measurement errors are down-weighted with respect to 
sources that have well measured shapes. 

In general, for the types of lenses studied in this paper 
(groups and low mass clusters of galaxies), the signal- 
to-noise per lens is not high enough to measure AE on 
an object by object basis so instead we stack the signal 
over many lenses. For a given sample of lenses, the total 
excess projected surface mass density is the weighted sum 
over all lens-source pairs: 



AE 



Efir El s r~ ^><7t« xe 



crit,ij 



7=1 Z^i=l 



N S a 



(8) 



3.2. Non- Weak Shear 



Equation [T] and subsequent derivations only hold in 
the weak gravitational lensing limit, that is to say when 
7 <C 1 and k <ti I (k — £/£ cr j t is the convergence). 
In reality, galaxy shapes trace the reduced shear, g = 
7/(1 — k). The masses of the groups that we are probing 
are such that the weak lensing assumption can begin to 
break down in the most inner bins (r < lOOkpc h^) and 
for high redshift source galaxies. In this regime, g ~ 7 is 
no longer a valid assumptio n. Followi ng the methodology 
outlined in Johnston et al. ( 2007 1 and Mandelbaum et al. 
2006a), it can be shown that our weighted estimator for 
£ will have a second-order contribution: 



G 
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AS = AS + AS E L 7 



^-3 < 



(9) 



(10) 



For further details, see E quation 19 in Johnston et al.| 
( 2007[ ) and Appendix A in Mandclb aum et al | ( j2006aP 
In a similar fashion to Johnston et al. T2007| , we ignore 
the variations of Lz within various radial bins. However, 
because our lens sample spans a large redshift range, we 
do not use a constant value of Lz for all redshift bins. 
Instead, Lz is calculated from the data for each redshift 
bin. Our values for Lz are given in Table 1. 

3.3. A Model for Predicting AS 

A halo model approach is used to model the surface 
mass densit y contrast AS as a function of transverse sep- 
aratio n (e.g. | Man delbaum et al. | 2005 |2006a|b Yoo et al. 
|2006| |Johnstoh et al.|2007[ ). The total signal is modelled 
as the sum of two distinct components that dominate 
the signal at different scales. The first term is due to the 
baryonic mass contained within the central galaxy and 
only contributes at scales below ~ 50kpc. The second 
term dominates the signal on intermediate to large scales 
(~ 50kpc to a few Mpc) and represents the group-scale 
dark matter halo (also known as the "one-halo term"). 
On the largest scales (above several Mpc) , the clustering 
of halos among themselves produces a contribution to 
AS via the so-called "two-halo term". However, we have 
found that the two-halo term is mostly sub-dominant at 
the scales that we probe. We have tested that the exclu- 
sion of the two-halo term has no impact on the results of 
this study and we therefore neglect this term hereafter. 

The baryonic mass of the central galaxies can have a 
non negligible contribution to AS at small transverse 
separations from the center of the stacked ensemble. Al- 
though the baryons typically follow a Sersic profile, at 
the scales of interest for this study, well above a few ef- 
fective radii (>40kpc), the lensing contribution of the 
baryons can be modeled by a simple point-source, scaled 
to (Mcg) ; the aver age stellar mass of the central galaxies 
(also see { 23] and j|4.4| : 



AS 



stellar 



(r) 



{Mi 



CG) 



(11) 



To be more precise, the baryons that have not yet 
transformed into stars should also be considered. How- 
ever, the majority of non-stellar group baryons are in 
the form of diffuse hot gas spread throughout the halo, 
in rough equilibrium with the dark matter potential. To 
first order, the gas mass contribution should follow the 
dark matter distribution. 

We assume that the de nsity profiles of dark m atter ha- 



los follow nfw profiles (Navarro et al. 
work, halo mass is defined as M200 = 



1997) 



In this 
< ^200) = 



200p cr i t (z) 37^200 and C200 denotes halo concentration. 
Numerous studies have demonstrated that the mass, con- 
centration, and characteristic formation epoch of dark 
matter halos are closely linked and on av erage, smaller 
halos tend to have higher concentrations (Bullock et al. 



2001 Wechsler et al. 2002; Maccio et al. 2007; Zhao et al 

I I I I I - I I I 



2008| ) . For this study, we adopt the Zhao et al.| ( |2008| 3 
mass-concentration relation for a WMA.P5 cosmology. 
By adopting this relation, AE n f w is fully described by 
two parameters, namely M200 and redshift. Analytical 
formulas fo r the AS corresponding t o a NF W profile can 
be fou nd in Wright fc Brainerd] ( |2000[ ) and in |Bartelmann| 
p96| . 

The additional gravitational potential due to the 
baryons is expected to modify the density profiles of dark 
matter halos via adiabatic contraction (Gnedin 2004 



2005, Sellwood and Mc Gaugh 2005). Nevertheless, [Man- 
delbaum et al.| ( |2006a[ ) have shown that this has a neg- 



ligible effect on the lensing signal on the scales that we 
consider (above 40kpc) and we neglect this effect for this 
work. 

The final model that we use for the weighted estimate 
AS is: 



AS = AS st 

cllar H~ A^nfw + AS n f w S n f w -^Z ■ 



(12) 



We have included the second order contribution to AS 
from non-weak shear. Note that only the dark matter 
halo contributes to this term because S sto n ar is zero at 
the scales that we probe. 

4. X-RAY GROUP SELECTION 

4.1. X-ray Selection Versus Shear Maps 

Among group and cluster probes, X-rays are perhaps 
the cleanest and the most complete selection method. 
Firstly, X-ray emission depends on the square of the gas 
density and so X-rays pick up the cores of dense struc- 
tures more accurately than SZ and are less prone to pro- 
jection effects. Secondly, unlike optical techniques which 
rely on galaxy properties, X-rays yield a complete sam- 
ple of groups and clusters, irrespective of their galaxy 
content. Finally, X-rays probe a wider range in mass 
and redshift than shear maps which are fundamentally 
limited by the shape of the lensing weight function. To 
illustrate the magnitude of this effect, in the upper panel 
of Figure [l] we show the expected lensing detection sig- 
nificance of X-ray structures in COSMOS as a function 
of mass and redshift. The lower panel in Figure [T] shows 
the comoving volume probed by the survey per unit red- 
shift. The theoretical lensing detection signifi cance is de- 
rived a ccording to the method outlined in Hamana et al. 
(120041) assuming a smoothed COSMOS redshift distribu- 



tion and isolated NFW profiles truncated at the virial 
radius. A source density of 66 galaxies/arcmin 2 and an 
average shape noise of tr* — 0.31 is assumed. Lensing 
S/N curves are based on fixed-scale Gaussian smoothing 
with a one arcminute smoothing kernel (an optimal filter 
would pick up slightly more signal) . Figure [T] demon- 
strates that lensing alone cannot detect low mass or high 
redshift objects. Instead one must resort to other de- 
tection techniques such as X-rays. Note that although 
the low redshift lensing sensitivity is relatively good, the 
volume probed is also quite limited. It is also important 
to note that the depth of the COSMOS data will proba- 
bly exceed any near- future space-based mission 4 . In this 
respect, Figure [T] can be considered as a realistic upper 

3 see http://www.shao.ac.cn/dhzhao/mandc.html 

4 The fiducial depth of EUCLID and J DEM is 1 ~ 25.5. 
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limit for lensing-based structure detection in the near- 
future (for both ground and space-based observatories). 

4.2. Construction of the Group Catalog 

The group catalog used for this w ork is an improved 
version of the catalog presented in |Finoguenov et ah] 
( 2007[| (hereafter F07) obtained by using additional 
XMM-Newton and Chandra data and by applying a new 
procedure for the removal of point-sources (Finoguenov 
et al., in prep). The group catalog contains a total of 206 
systems in the COSMOS ACS field, 170 of which are at 
z < 1. 

The group catalog is constructed from both the XMM- 
Newton and the Chandra mosaics. All features with spa- 
tial extents below 16" are removed before co-adding the 
two mosaics. The combined mosaic is used to search 
for sources with spatial extents on both 32" and 64" 
scales usi ng the wavelet decomp osition technique de- 

Both Chandra and 
resolution that is better 



10 42 /i 72 2 ergs 1 and so only concerns a few of the sys- 
tems considered here. For the Lx — T relation, we have 
used kT/keV = 0.2+6 x io( 1 °g( i x£( z )- 1 )-44.45)/2.i) which 



(1998) 
ial 



scribed in Vikhlinin et al. 
XMM-Newton have a spa 
than 30". The Full Width Half Maximum (FWHM) of 
the PSF is approximately equal to 16" for XMM-Newton, 
and varies between 3" and 8" for Chandra. 

Total X-ray fluxes are obtained from the measured 
fluxes by assuming a beta profile and by removing the 
flux that is due to embedded AGN point-sources. Previ- 
ous surveys have often assumed that all of the X-ray flux 
within a n extended source area is due to cluster emis- 
sion (e.g. Bohringcr et al. 2004). However, this assump- 
tion breaks down for deep surveys such as COSMOS. 
In particular we have calculated that on average, AGN 
contribute up to 30% of the extended X-ray emission 
in the 0.5-2 keV band, while for ROSAT All Sky Sur- 
vey (RASS) clusters, the estimated value is less than 
2%. Point-source removal is thus necessary for COS- 
MOS. However, the procedure that removes the flux from 
point-sources also removes flux from cool-cores and as a 
consequence, the total flux is underestimated. For com- 
parison with other work, it is important to correct for 
this accidental removal of the flux from cool-cores. There 
have been cla ims that the cool-core fraction is evolving 
with redshift (IJeltema et al.||2005| | Vikhlinin et al.||2007 
Maughan||2007p. We have therefore developed a methoc 



to correct for this effect directly from the data by using 
the high-resolution Chandra observations of the COS- 
MOS field which cover a contiguous are a of 0.5 degrees 2 
with the best PSF (3", |Elvis et al.|2009|, sufficient to dis- 
tinguish between cool-cores and AGIN . Taking advantage 
of these data, we compute a cool-core correction factor 
(noted fee) using the following procedure. To begin 
with, groups inside the high resolution Chandra area are 
binned into t he sa me nine bins as used for the lensing 
analysis (see {6.1). Next, wavelet scales below 4" are 
used to remove point-sources. Finally, the Chandra flux 
is stacked in each of the nine bins and a 16" aperture is 
used to estimate the average cool-core flux. The results 
are listed in Table 1 and range from a 3% to a 17% flux 
correction. 

Rest-frame luminosities are calculated from the to- 
tal flux following £o.i-2.4kcV = ^d 2 L K(z,T)C l 3(z,T)F d , 
where K(z,T) is the K-correction and Cp(z,T) is 
an iterative correction factor (see F07). The uncer- 
tainty in K(z, T) affects groups with luminosities below 



introd uces a break at group scales (as discussed in Voit 
2005| ) but which reproduces the Markevitch (2002) result 
at cluster scales. Recent work by Pratt et al. (2009) have 
derived a similar slope for the Lx — T relation for clus- 
ters as Markevitch (2002). The behaviour of the L x - T 
relation is not very well established at low temperatures, 
however exploring the effects of a different Lx — T re- 
lations is beyond the scope of this paper and is left for 
future work. 

Quality flags (named hereafter "xflag") are derived 
for the entire group catalog based on visual inspection 5 . 
xflag=1 is assigned to groups with a single optical 
counterpart and with a clear X-ray peak. These are 
mostly high-significance X-ray detections (over 6cr) in 
zones free of projection effects. The flag XFLAG=2 is as- 
signed to systems for which the extended X-ray emission 
is subject to projection effects but for which the vari- 
ous projections can be disentangled. Systems with ques- 
tionable optical/IR counterparts are assigned xflag=3. 
These arc primarily high-z (z > 1) candidates that are 
not considered in this work. xflag=4 indicates that 
there are several equally possible optical counterparts 
and that the X-ray flux cannot be disentangled for pro- 
jection effects. xflag=5 is assigned to systems for which 
the optical counterpart is uncertain and xflag =6 to 
identified extended emission not associated with galaxy 
groups (these are mainly interacting galaxies and X-ray 
jets, Smolcic et al., in prep.). xflag=7 is assigned to 
unidentified emission and xflag=8 assigned to systems 
located in the masked-out regions (edges of the survey 
and regions near bright stars). In this work, we only 
consider systems with xflag=1 or xflag=2. 

To ensure a high quality and robust lens catalog, in 
addition to the X-ray quality flags, all systems have been 
visually inspected and flagged for proximity to the edge 
of the ACS field, and contamination by the presence of a 
bright foreground star which will affect both the lensing 
measurements and the determination of the central group 
galaxy. All systems with such flags were removed from 
the lens catalog. In total, after all quality cuts, the lens 
catalog contains 127 systems at z < 1. None of these 
quality cuts are expected to bias the remaining sample 
but will improve the estimations of both Lx and M200 ■ 

4.3. Redshift Determination 

The optical counterparts of X-ray sources are identi- 
fied using a sophisticated red-sequence method. The de- 
tails of this technique are presented in Finoguenov et 
al. (2009) - only a brief outline is given here. Along 
the line-of-sight of each X-ray source, the redshift range 
< z < 2.5 is probed for potential red-sequence over- 
densities. For a given redshift z — zrs with < zrs < 
2.5, galaxies are selected within an aperture of 0.5Mpc 
(physical) from the center of the X-ray emission such 
that jzphot — £rs| < 0.2. The apparent size of the aper- 
ture is defined in terms of a physical scale and will vary 
with ZRg. Weights are derived for all galaxies according 
to their proximity to the center of the X-ray emission 

5 Visual inspection is performed on the XMM-Newton, ACS and 
Subaru data simultaneously. 
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Fig. 1. — Upper panel: theoretically computed lensing detection significance (dashed curves) of X-ray structures (data-points) in the 
COSMOS field as a function of mass and redshift. Lensing detection significance values have been derived with the method outlined in 



Hamana et al. 



oothed COSMOS redshift distribution and a source density of 66 galaxies/arcminute 2 . The predictions 
m tms Figure represent a realistic upper limit for all near-future weak-lensing surveys, including space-based missions such as JDEM and 
EUCLID. Indeed, the survey depth of these missions is unlikely to exceed the current depth of COSMOS. Direct structure detection via 
shear maps is limited by the lensing weight function at high redshifts and at low masses. To identify such structures, one must resort 
to other detection techniques. Lower panel: comoving volume probed by the (1.64 degrees 2 ) survey per unit redshift. Although the low 
redshift lensing sensitivity is relatively good, the volume probed is also quite limited. 



and to the comparison of both their color and magni- 
tude to a model red-sequence at zrs- The red-sequence 
detection significance is determined by applying the same 
procedure to random COSMOS fields. If there are mul- 
tiple red-sequence over-densities along the line-of- sight, 
the one with the highest significance is selected. Red- 
sequence redshifts are then refined using spectroscopic 
information whenever possible. 

Of the 127 systems at z < 1 (after the quality cuts), 
81% have two or more spectroscopically confirmed mem- 
bers, 7% have one spectroscopically confirmed member, 
and 12% have a redshift that is determined solely by the 
red-sequence method. The average redshift error for the 
group ensemble at z < 1 is estimated at 0.006, somewhat 
larger than the typical velocity dispersion of our groups 
which is about 300 km s _1 . 

4.4. Center Determination 

Stacked weak lensing measurements require the identi- 
fication of the dark matter density peak. Centroid errors 
will lead to a smoothing of the lensing signal on small 
scales and to an underestimate o f halo mass (e.g. see dis- 
cussion in I Johnston et al. 112007 1. With the exception of 



on- going, nearly equal mass ratio mergers, the centroid 
of the X-ray emission should indicate where the potential 
well is deepest. Halo centers are also often assumed to 
contain central galaxies (CGs) which can be used as good 



tracers on condition that they can be correctly identified. 

The wavelet-re constructed X-ray imag e is analysed 
with SExtractor (Bertin & Arnouts 19961 to determine 
the centers and 2d shapes ot the extended X-ray emission. 
The accuracy of the determination of the X-ray center is 
higher for xflag=1 systems than for xflag=2 systems 
which are somewhat affected by projection effects. The 
maximum uncertainty of the peak for xflag=2 systems 
is determined by the size of the wavelet scale, which is 
32". Although the X-ray centroid is not precise enough 
to be used directly in most cases (of 127 system that re- 
main after the quality cuts described in previous sections, 
55 have xflag=1 and 72 have xflag=2), it is precise 
enough to be used as a strong prior on the location of 
the CG. Indeed, the maximum uncertainty in the X-ray 
centroid is 32" (193 kpc at z=0.5), however, most 
systems have a smaller positional uncertainty than this. 
For comparison purposes, the average projected radial 
offset for mis-identified CGs in MaxBCG is larger than 
600 /iy 2 kpc (Figure 5, Johnston et al. 20071 (note that 
their distances must be converted to our assumed value 
of .Ho). 

In many previous studies, Brightest Cluster Galaxies 
(BCGs) have been associated with the CGs of group 
and cluster halos. Given the ambiguity in the choice 
of the filter in which BCG galaxies should be taken as 
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"brightest" as well as the sensitivity of optical luminos- 
ity to recent star-formation, we prefer the utilization of 
the Most Massive Central Galaxy (MMCG) located near 
the peak X-ray emission (where massive refers to stel- 
lar mass). We assume that the MMCG can be used 
to trace the center of the dark matter halos of groups 
and clusters. An automatic algorithm was developed 
to identify MMCGs. Briefly, for each X-ray group, a 
broad group member selection is made by selecting galax- 
ies within 800kpc of the peak X-ray emission such that 
\zphot ~ Zg roup \ < 0.03 x (1 + z group ). Next, galaxies are 
rank-ordered by their stellar mass and weighted by the 
proximity to the peak X-ray emission - the MMCG is the 
galaxy with the highest rank. The results were visually 
inspected and divided into three categories: 

1. CG-TYPE=1: the CG is visually obvious (for the 
most part, a dominant early type galaxy with an 
extended envelope) and the algorithm has selected 
the correct galaxy. 

2. CG-type=2: there is a visual ambiguity in the CG 
selection but we estimate that the algorithm has 
selected a galaxy that has a 50% chance of being 
the CG. 

3. CG-type=3: the visual identification is highly am- 
biguous or there is some other problem that pre- 
vents the identification of the CG. 

In this study, we only consider CG-type=1 and CG- 
type=2 systems. In combination with the quality cuts 
described previously, these cuts leave a total of 118 
groups and clusters at z < 1 (95 of which are CG-type=1 
and 23 are CG-type=2). The details of the MMCG se- 
lection as well as tests regarding centering uncertainties 
will be presented in a forthcoming paper (George et al., 
in prep). 

In terms of stacked weak-lensing, there are two mis- 
centering effects to be taken into consideration. The first 
is that the location of CGs could be poor tracers of the 
actual centers of their dark matter halos. The second is 
that the CGs could be mis-identified. In a similar fash- 



ion to the maxBCG studies (Sheldon et al. 112007 John 



ston et al. 


2007 


analysis di 


tiers i 



2007), we neglect the former. However, our 



ri(. 



to the latter. 



|Johnston et al 
identification fraction of 



(20071 assume a CG mis- 
30% and apply a mis-centering 
kernel in their analysis to account for this effect. In this 
study, we assume that our X-ray prior combined with 
thorough visual checks, allows us to correctly identify 
the CG for a majority of our systems and that when a 
CG is mis-identified, the projected radial offset from the 
dark matter center is not large. Testing this assump- 
tion in further detail will be the focus of a subsequent 
paper (George et al., in prep). Nonetheless, in f|7] we 
have also demonstrated that restricting our analysis to 
CG-type=1 systems does not affect our results, indicat- 
ing that errors due to mi-centering are probably not a 
dominant effect for this work. 

We also note that our CG selection is based on stellar 
mass and is insensitive to color, hence we avoid the prob- 
lem of blue core BCGs which represent about 25% of the 



BCG population according to Bildfell et al. (2008). The 



offset from the red sequence of 0.5 to 1.0 mag in (g' - r') 
which could lead to their rejection by red-sequence type 
methods. 

5. FUNCTIONAL FORM OF M - L x 

In this section we present previously published results 
and overview the various assumptions that are made re- 
garding the functional form of M — Lx- We also explain 
how the scatter between mass and luminosity can cause 
subtle differences between the study of M as a function 
of Lx (the "M — Lx relation" ) and the study of Lx as a 
function of M (the "L x - M relation"). 
Previous measurements of the Lx — M relation based 



on X - ray data (|Reiprich fc Bohringer||2002| |Allen et al 
20031 IPopesso et al.||2005| |Chen et al.|2007| |Pratt et al 
2008| |zhang et al.||2008| |Vikhlinin et al.|2009p as well as' 
the the M — Lx relation derived with lensing dHoekstra 



2007a Bardeau 



et^[2007] |Rykoff et af][2008l ) are by- 
;ent with a power-law, but witn a slope 



and-large consistent with a power- 
and amplitude that differ from the self-similar prediction 

3 /4 

of M oc L x . In contrast, the evolution of the Lx — M 
relation is still under much debate with certain authors 
finding t hat the Lx — M relation evolves in a self-similar 
fashion (|Lumb et 



al ||2004[ lA rna ud et al. || 2005 | |Kotov 
& Vikhlinin 2005; Maughan 2007]) while others do not 
qtittori et ai[2004p . 

In addition to the shape and evolution of the mean 
Lx — M relation, astrophysical processes are expected to 
induce scatter in Lx at fixed mass which is important to 
take into consideration. In the absence of strong obser- 
vational or theoretical guidance fo r the form and magni- 
tude of this scatter ( although see Reiprich & Bohringer 



2002 Mau 



tic model 



ighan 2007), it is common to adopt a stochas 
where P(Lx\M) is a log-normal probability 



distribution function (hereafter PDF) with a mean log 
luminosity that follows (InLx) oc In M and with a con- 
stant log-normal s c atter noted ain Lv ( Stanek et al.|2006 



Rozo et al.| 



_ 2008; Ryko ff et aLl|200§[ | Vikhlinin et a 
2009| . In this particular case, and under the further 
condition that the halo mass function is a power-law, 
it can be demonstrated that P(M\Lx) is also a log- 
normal probability function with a dispersion in mass 
equal to <J\ n M — cinL x /P and a mean log-mass that 
follows (InM) ex alnLx with a = 1/(3 (see Appendix 
for further details). As a consequence, the slopes of the 
Lx — M and the M — Lx relation can be compared quite 
simply but the comparison of the normalization will de- 
pend on the halo mass function. 

The difference in the normalization of P(Lx\M) and 
P(M\Lx) can be seen as a form of extended Malmquist 
bias. This is not Malmquist bias is the classical sense 
because it will occur in any survey, independently of the 
flux limit. The equations for this bias are derived in the 
Appendix. In g eneral, X-ray astronomers commonly em- 
ploy P(Lx\M) ( |Stanek et al.|2006||Vikhlinin et al.|2009|) 
whereas lensing more naturally derives P(M\Lx)( Rozo 
et~all[2008| |Rykoff et al] |2008) so care must be taken 



excess blue light in these systems can lead to a typical 



when comparing the two. 

In reality, the slope of the mass function varies with 
both mass and redshift and as a consequence, a = 1//3 
no longer holds when slopes are derived over a large range 
in masses. Corrections for this effect are derived in the 
Appendix. 

Stacked weak lensing yields a measurement of the 
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arithmetic mean of the surface mass density contrast, 
(AS(r)). When the data are binned according to a 
well chosen proxy and over a narrow redshift range (to 
avoid smearing the profiles due to evolution in the mass- 
concentration relation for example), the mass derived by 
fitting (AS(r)) will be close to to the arithmetic mean 
of the stacked ensemble (M 200 ). For this reason, we se- 
lect narrow redshift bins that further enable us to as- 
sume that (M WQ .E(z)) ~ (M 200 )E((z)). Note that if 
the PDF of the mass at fixed luminosity is log-normal, 
then (M 2 oo) will be different from the peak of the PDF. 
Indeed, in this case, the peak is traced by the median, 
not the arithmetic mean. If the scatter of P(M\L) is 
known, then the correspondence between the two is given 
by (M 200 ) = exp((ln(M 2 oo)) + < M /2). Figure | illus- 
trates the various issues outlined above and for which 
more detailed calculations are presented in the Appendix. 

Estimates for the scatter 7 in the M — Lx relation vary 
from <7inM ~ 0.2 to <J\ n M ~ 0.3 and can depend exactly 
on how Lx is measured. The lowes t scatter is obtained 
with cool-core excision tec hnique s (|Stanek et al. 2006 
M aughan|[2007l |Pratt et al.||2008[ ). For distant clusters 
e cluster core region can become smaller than the ob- 
served PSF and so cool-core excision becomes infeasible. 
However, there have been s uggestions that the coo l-core 



M; 
th 



fraction is low at z > 0.5 (Vikhlinin et al. 2007), per- 
haps mak ing excision unne cessary at higher redshifts. In 
addition, |Maughan| ( 2007 1 have suggested that o\ n m is 



reasonably small even lor survey quality data. In any 
case, although the scatter in mass at fixed luminosity is 
still poorly constrained (especially at high redshift), by 
most estimates it is smaller than the scatter in mass at 
fixed richness, ev en for the best richness estimators (e.g. 
Rozo et al.||2008} find <r\ nM \\ = 0-45). 

Given the various considerations discussed above, we 
adopt a power-law form for the mean relation between 
mass and luminosity with a redshift evolution that fol- 
lows self-similarity: 



(Mgoo E{z)) 
M 



A 



(Lx E(z) 



L 



XA) 



(13) 



where M = IO^H^Mq and L x ,o = 10 42 7 fef 2 2 ergs" 1 . 
Devi ations from self-similar evolution are tested for in 

To begin with, we derive the relationship between 
the mean mass (M2oo)E((z)) and the mean luminosity 
(L x E(z)- 1 ) using only the COSMOS data. This rela- 
tion is referred to as 1ZIm-l x - Next, we combine the 
COSMOS results with previously published cluster data 
to improve measurements of the slope a. This combined 
relation is referred to as 1Z2m-l x ■ 

6. RESULTS 

In this section we present our lensing measurements as 
well as the M — Lx relations 1Z1m-l x and H2m-l x - We 
also test for additional redshift evolution in the M — Lx 
relation beyond that predicted by self-similarity. 

6.1. Stacked estimates of AS and M 200 



7 Scatter is quoted as the standard deviation of the natural log- 
arithm of the mass at fixed Lx • 



The data are divided into nine bins labeled Aq through 
As (see Figure [3] and Table [lj. The bins are selected to 
encompass a narrow range in redshift and LxE(z)~~ l so 
as to avoid smearing out the signal due to evolution in the 
mass-concentration relation. For each bin, the stacked 
weak lensing signal is calculated according to the method 
outlined in Sj3] 

We calculate the weak lensing signal from 40kpc to 
4Mpc in logarithmically spaced radial bins of 0.26 dex. 
A weak lensing signal is detected all the way to 4Mpc, 
allowing us to probe the full extent of the one-halo term. 
The results are fit with the parametric model given by 
Equation [l2j 

We use a Markov chain Monte Carlo (MCMC) method 
to fit the AS profiles. The MCMC routine uses the 
Metropolis-Hastings algorithm with a Gaussian trans- 
fer function. The total number of steps is 30000 and 
a burn-in period of 500 is discarded. Bins with less than 
15 background sources in total are excluded from the fit. 

The results from the stacked analysis and the fits to the 
profiles are shown in Figure HJ At the smallest radii that 
we probe, the stellar mass of the central galaxy plays a 
minor role in the lensing signal but we have added it for 
consistency. The scales that we probe are dominated by 
the signal due to the dark matter halo associated with 
the groups. Our estimates for (M 2 oo) are given in Table 

m 



6.2. Measurement of HA m-l x 
Fitting only the low redshift COSMOS data (from AO 
to A6, 0.2 < z < 0.5), we obtain the best fit parameters 
log 10 (A) = 0.068 ±0.063 and a = 0.66 ±0.14 (see Figure 
[5]). The cited errors are statistical only. The effects of 
systematic errors are explored in f|7] and are estimated 
to be lower than the statistical uncertainty. The cool- 
core correction factor that we apply does not strongly 
affect these results. Without the cool-core correction, we 
obtain loe, (A) = 0.09 ± 0.062 and a = 0.64 ± 0.14. 

Figure [6] compares the COSMOS 1Z1m-l x relation to 
four otherlensing-based m easurements: the SDSS results 
from iRykoff et al. l (|2008] ) (hereafter R08), four groups 
from |Berge et al. (2008 [(hereafter BE08), cluster data 
from jHockstra (200 7b| (hereaft er H07) with masses up- 
dated in |lVlahdavi et al | ( 2008 1 , and cluster data from 



Bardeau et al.| ( |2007p ~(herealter BA07). All data points 
have been normalized to Hq = 72 hr 2 km s _1 Mpc -1 and 
scaled by E(z). The cluster data points from BA07 and 
the H07 have been selected on the basis that their lens- 
ing analysis extends to the virial radius, allowing them to 
derive mass estimates that are comparable to ours. Each 
of these four independent studies probes a distinct red- 
shift and mass scale. Nevertheless, the ensemble of data 
points displays a remarkable trend that spans over three 
orders of magnitude in IjxE(z)^ 1 and two orders of mag- 
nitude in M 200 E(z). The varying degrees of scatter seen 
between the different data-sets is due to the fact that 
some results are direct detections of individual clusters 
(e.g. H07, BA07 and BE08) while other data points have 
been stacked (e.g. R08). The COSMOS data points are 
scattered about the mean relation because each bin con- 
tains a relatively small number of groups (tens of groups 
as opposed to hundreds in R08) . We will now briefly de- 
scribe each of these data sets. Further discussion of the 
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Fig. 2. — Illustration of the effects of biases in the Lx — M and the M — Lx relations at z = 0.2. The conditional probability distribution 
of the luminosity given the mass, P(]-x VI. z), is assumed to be log-normal with a mean log-luminosity that follows a power-law scaling 
relation (blue solid line) with a slope of 1.63 = 1/0.61 and with a scatte r of m-r, i = 0.4. A s imulated ensemble of groups and clusters 
of galaxies is created for which the number densities per mass follow the |Tinker et al.| l |2008^ halo mass function and for which X-ray 
luminosities are attributed according to P(Lx|M, z). The grey shading in the upper panel is proportional to the log of the number density 
of groups and clusters in the luminosity-mass plane (arbitrary normalization). The mean log- mass follows the cyan dashed line and the log 
of the mean-mass (log 1 Q(M20o), measured by lensing) is shown by the dash-dot magenta line. There is a mass-dependent bias between the 
cyan and the dark blue line equal to b m = <r^ n ( , y — 1) where 7 is the slope of the mass function (see Appendix). As a results of this bias, 
there is a slight curvature in both the cyan and the magenta lines at high halo masses. 



agreement between various results is presented in S|8] 

The R08 data points (light blue, plus signs) are taken 
from Table 1 of their paper. Lx and M200 have been 
normalized to our adopted value of Hq and scale d with 
the E(z) factor at the quoted redshift of z = 0.25. |Man- 



delbaum et al. (2008) have shown that the masses pub 



lished by Johnston et al.| ( |2007[) which have been used 
in the R08 analysis, must be boosted by a factor of 
1.18 1 ' 4 = 1.25 when the SDSS source distribution is cal- 
ibrated against zCOSMOS spectroscopic redshifts. We 
boost the R08 data points by a factor of 1.24 (a re- 
vised version of the published correction, R. Mandel- 
baum, priv. comm.) and the results are shown by the 
small black triangles in Figure [6] The upper error bars of 
the R08 data points are increased in order to reflect this 
correction. Note that because both M200 and Lx have 
been derived via stacking methods, the results of R08 
will depend on the covariance between Lx and richness 
(noted 7V 2 oo, see R08) at fixed mass: the slope of their re- 
lation will change depending on the correlation between 
these two parameters. This is not an issue for COSMOS 



where the stacking is performed directly on Lx instead 
of using an intermediate variable such as N 2 oo- 

The masses for the BE08 data points (dark green, as- 
terisk signs) are taken from their paper and Lx has been 
provided by F. Pacaud (priv. comm.). 

The masses for the BA07 data points (sienna, cross 
signs) are taken from their paper. X-ray luminosities are 
derived from the XMM LOCUSS sur vey using the same 
X-ray data as in Zhang et al. (2008) but by integrating 



the flux out to the truncation radius of 2.5r^ n '"' i . Note 
that rj^ wl is given in §6.2.3 in|Zhang et al.| (|2008|). The 



imaging data for the BA07 analysis is based on ground- 
based wide-field imaging obtained with the CFH12k cam- 
era on the Canada France Hawaii Telescope (CFHT). 
One cluster in the BA07 paper did not have an XMM 
LOCUSS luminosity and has been discarded (namely 
A2219). 

The H07 data points (dark blu e diamonds) have been 
taken from column 8 of Table 1 in Mahdavi et al.|p008[ ). 
Luminosities are derived from the LOCUSS survey using 
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TABLE 1 

Various properties for each of the nine bins 



Bin ID N LE ns {L X -E(z) 1 ) / C c a (Af200> (z) E((z}) L z fbias fboost fbias X fboost 

(10 42 h" 2 erg s- 1 ) (10 13 M ) (lO" 4 h^f pc 2 M~ 2 ) 



AO 


1 


31.14 ±0.49 


1.0 


14.9±Ii 


0.22 


1.10 


3.16 


1.01 


1.00 


1.01 


Al 


3 


13.75 ±0.50 


1.0 


o q+3.3 
°- 2 -2.3 


0.36 


1.18 


3.78 


1.02 


1.00 


1.03 


A2 


3 


6.04 ±0.28 


1.17 


9.9+H 


0.35 


1.17 


3.77 


1.01 


1.00 


1.02 


A3 


11 


2.21 ±0.15 


1.08 


o 9+1-3 
J - -0.9 


0.36 


1.18 


3.77 


1.02 


1.00 


1.02 


Ad 


7 


0.90 ±0.06 


1.03 


2 1+ 15 
z - i -0.8 


0.23 


1.11 


3.20 


1.01 


1.00 


1.01 


A5 


13 


1.24 ±0.09 


1.05 




0.35 


1.18 


3.77 


1.01 


1.00 


1.02 


A6 


11 


3.65 ±0.21 


1.17 




0.50 


1.27 


3.96 


1.03 


1.01 


1.04 


A7 


23 


4.72 ±0.24 


1.13 


2 6+ 11 


0.69 


1.41 


3.84 


1.04 


1.02 


1.06 


A8 


21 


10.51 ±0.50 


1.14 


7 fi +23 


0.90 


1.58 


3.65 


1.03 


1.05 


1.08 



a The cool-core correction factor that is applied to column 3 (sec description in §2.1). 
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Fig. 3. — COSMOS groups within the ACS field as a function of redshift and Lx-E(z) -1 , binned into nine sub-samples labeled Aq through 
As- Each bin is designed to encompass a narrow range in both redshift and Ly^E(z)~ 1 . The 4<r sensitivity limits are shown by the green 
lines: 96% of the ACS field is covered to a sensitivity limit of 1.0 10 — 15 erg cm -2 s — 1 where the sensitivity limits have been derived using a 
wavelet sensitivity map on group scales (green dotted line), and 52% of the ACS field is covered to a deeper limit of 6.7 10 — 16 erg cm -2 s _1 
(green dashed line). Small black circles represent systems that were rejected from the lens catalog. Blue filled circles indicate systems for 
which the central galaxy (CG) identification is certain whereas black triangles show systems for which the identification is more ambiguous. 



the methodology described in the previous paragraph. 
The overlap between the surveys is the following set of 
clusters: A68, A209, A267, A383, A963, A1689, A1763, 
A2218, and A2390. Note that BA07 and H07 have stud- 
ied a common set of clusters using the same imaging data 
(CFH12k camera). Differences is cluster mass estimates 
between BA07 and H07 are probably due to dissimilar 
data analysis techniques (such as assumptions regarding 
the mass-concentration relation for example). 

6.3. Measurement of 1Z2m-l x 
In this section, we perform a joint fit between the COS- 
MOS data and cluster data from H07 and BA07. The 
high redshift COSMOS data points (bins A7 and A s ) are 
excluded from this fit so that all data points are at a com- 
parable redshift (z ~ 0.3). The joint fit COSMOS/H07 



yields log 10 (A) = 0.03 ± 0.06 and a = 0.64 ± 0.03. The 
joint fit COSMOS/B07 also yields log 10 (A) = 0.03±0.06 
and a = 0.64 ±0.03. These results raise several points of 
interest. Firstly, the fit to the COSMOS data alone gives 
a very similar relation to the fit when the cluster data is 
added. This suggests that the M — Lx relation is invari- 
ant from group to cluster scales with no detected break 
at group scales. Secondly, the combination of group and 
cluster data creates a large dynamic mass range which 
allows for a 5% (statistical error) determination of the 
slope of the M — Lx relation. Finally, despite the sys- 
tematic differences in the mass estimates of H07 and 
BA07, the combined relations with COSMOS are identi- 
cal. The addition of group data has significantly reduced 
the impact of cluster mass uncertainties on the measured 
M — Lx relation. 
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Fig. 4. — Stacked weak lensing profiles of COSMOS groups for nine bins that each span a narrow range in redshift and LxE(z)~ l . From 
left to right and top to bottom we show the bins Aq through Ag. A weak lensing signal is detected all the way to 4Mpc, allowing us to 
probe the lens density profiles well beyond the virial radius. The solid blue curve shows our fit to the data which is the sum of a bar yoni c 
term (red dash-dot), an NFW profile (green dash-dash), and a second order weak shear correction term (orange dash-dot-dot, see § |3.2[ l. 
Grey points are negative data-points and bins with less than 15 source galaxies. 



6.4. Redshift Evolution in the M — Lx relation 

In this section we test for redshift evolution in the 
M — Lx relation by adding an additional redshift depen- 



dent term to our previously assumed M 
follows: 



Lx relation as 



(Af 200 E{z)) 
M a 

(M 200 E{z)) 
M 



= A 



= A 



[L x E(z)- 



E(zf 



(14) 



[L x E(z)- 



L 



x,o 



(1 + z) 6 (15) 



Fitting the COSMOS data alone with Equation [14 
yields log 10 (A) = 0.05 ± 0.1, a = 0.70 ± 0.13, and 7 = 
-0.07 ± 0.9. Equation p] yields log 10 (A) = 0.07 ± 0.14, 
a = 0.70 ± 0.13, and -0.14 ± 0.8. In both cases, 
the COSMOS data is consistent with self-similar redshift 
evolution, to the level that can be probed with these 
data. Additional cluster and group data with weak- 
lensing masses at 0.6 < z < 1.0 would be required in 



order to constrain the redshift evolution in the M — Lx 
relation to higher precision. 

7. ASSESSMENT OF SYSTEMATICS ERRORS 

There are three potential causes of systematic errors 
in this work: errors in the photometric redshifts, mis- 
centering, and uncertainty in the mass-concentration re- 
lation. As demonstrated below, we have found that the 
systematic errors associated with each of these effects are 
below the statistical error, at least to the extent that this 
can be tested for with the current data. 

7.1. The impact of photometric redshift errors 

The effects of redshift errors on group-galaxy lensing 
signals can be broadly categorized as follows: a) uncer- 
tainties in the redshifts of the lenses will smear out the 
signal and affect the derivation of £ cr ; t , b) errors in the 
mean source redshift distribution will introduce a bias 
in the normalization of the overall signal, c) improper 
lens-source separation will lead to a dilution of the sig- 
nal and will subject the signal to the effects of intrinsic 
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Fig. 5. — Left panel: marginal posterior distribution for the pa- 
rameters of 1ZIm—Lx (COSMOS data only). The blue shaded 
region denotes the 68% (lo") confidence region and the red shaded 
region denotes the 95% (2<r) confidence region. The dash-dot line 
indicates the self-similar prediction for the slope of the M — Lx re- 
lation. Right panel: marginal posterior distribution for t he combi- 
nation o f the COSMOS group data and cluster data from |Hoekstra| 
l |2007b| l . The large dynamic mass range created by the combination 
ot group and cluster data enables a more accurate determination 
of the slope of the M — Lx relation. 

alignments, d) and catastrophic errors can also dilute the 
signal. In this work, we neglect (a) on the basis that the 
rcdshifts of the groups are well determined (81% have 
two or more spectroscopically confirmed members). 

To reduce the effects of (c) and (d) , we use the full red- 
shift PDF which is deriv ed by the photoz c ode LePharc 



2009J). The mam 

represents the most prob- 



for each source galaxy (Ilbert et al. 
peak in the PDF, zp_max, 
able redshift. When present, a secondary peak in the 
PDF is noted ZP_SEC. The galaxy population with dou- 
ble peaked PDFs is expected to contain a large fraction of 
catas trophic errors (roughly 40%-50%, pjert et al.|2006j 
20091. In this work, we eliminate sources with a double 
peaked PDF in order to minimize the effects of signal di- 
lution caused by catastrophic errors. Making this photoz 
cut leads to a decreased background number density of 
34 galaxies/ arcmin 2 . 

The redshift PDF information is also used to improve 
the lens-source separation by using the lower 68% confi- 
dence bound on the source redshift. Sources are selected 
such that z$ — zl > 068% ( z s)- It is also important to 
note that group-galaxy lensing signals are most sensitive 
to redshift errors when z$ is only slightly larger then 
zl (see Figure 8). For this reason, in addition to the 
previous cut, we also implement a fixed cut such that 
zs — z~l > 6z, where Sz is defined below in the following 
two schemes: 

• 51: Only sources with a single peaked PDF are 
used, z s - z L > cr 68 %(zs), and z s - z L > 0.1 (the 
default scheme used throughout this paper). 

• 52: Only sources with a single peaked PDF are 
used, z s - z L > cr 68 %(zs), and z s - z L > 0.2. 

We test each of these two schemes and the results are 
shown in Figure W\ Choosing a value of 5z = 0.1 or 
Sz = 0.2 has a negligible impact on M 2 oo- 

In order to quantify the errors associated with (b) and 
(d), we make use of the ensemble of spectroscopic red- 
shifts that are currently available from the zCOSMOS 
program. As illustrated in Figure [HJ source galaxies 



with Zphot > ^lons and z spoc < z lQns dilute the signal 
whereas galaxies with z p hot > zi cns and z spcc > z\ cns but 
Zphot 7^ z spoc will introduce a bias in AS because E cr it 
will be mis-estimated when transforming 7 into AE. The 
correction factor for biases in the photometric redshifts is 
noted fbias while fboost represents a number greater than 
1 that boosts the measured signal to compensate for sig- 
nal dilution. The true signal is related to the measured 
one according to: 



AE r 



^ fbias ^ fboost- 



(16) 



Using the ensemble of zCOSMOS spectra that are cur- 
rently available, we have estimated fbias and fboost for 
each of our nine bins. The maximum bias that we find 
is a 7% upwards correction on AE for the last redshift 
bin which corresponds to a revised mass of 8.5 x 10 13 
Mq. This is a small correction compared to our mea- 
sured value of 7.6^ 2 /| i x 10 13 M . In conclusion, to 
the extent that we can estimate fbias and fboost using the 
current zCOSMOS data, we find that the errors due to 
imperfect photometric redshifts are below the statistical 
uncertainties. It is important to note, however, that the 
zCOSMOS spectroscopic sample may not be fully rep- 
resentative of the background source population because 
of incompleteness. Thus it is possible that our estimates 
of fbias and fboost are erroneous. For this reason, we do 
not apply these correction factors to the data and have 
simply listed the values of fbias and fboost in Table 1 as 
an indication of the probable systematic uncertainty due 
to photometric redshift errors. 

7.2. The effects of mis-centering 

To test for mis-centering effects, we recalculate the 
lensing signals using only systems with CG-type=1 in- 
stead of using both CG-type=1 and CG-type=2. The 
results are shown in Figure [7] and we find that restrict- 
ing our analysis to the sub-sample of groups that have 
visually obvious central galaxies has no impact on our 
estimates of M 2 oo- 

7.3. The mass concentration relation 

Testing for the effects of theoretical uncertainties in 
the M200 ^ C200 relation is beyond the scope of this 
paper, however, one test we can perform is whether or 
not different -M200 — C200 relations affect our mass es- 
timates. For this purpose, we comp are two recen t ly de- 
rived M200 — C200 rel ations, one from l M accio et al. ( 2007 1 
and the second from Zhao et aT| ( |2008[) . We compute our 
lensing signals with each of these relations and show that 
the results are largely unaffected by this test. Note that 
the agreement in the M200 — C200 relation from various 
authors is fairly good in the mass and redshift regime of 
our group sample (the typical fractional difference is 10% 
to 20%). However, at higher masses the disagreement is 
larger and hence the assumed concentrations in H07 and 
BA07 could represent a systematic error in the joint fit 
between COSMOS and the cluster data. 

7.4. Conclusions regarding systematic uncertainties 

As demonstrated in Figure [7] all the effects that we 
have tested for are largely negligible compared to the 
statistical error. However, one aspect that we have not 
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Fig. 6. — The COSMOS M — Lx relation. Dark blue diamonds show individually detected clusters from H07 with updated masses from 
Madhavi et al. 2008. Sienna cross symbols show data points from BA07. Light blue plus symbols represent the R08 results from a stacked 
analysis in the SDSS and black diamonds take into account a recent correction to these masses due to a new calibration of the source 
distribution. The upper error bars have been adjusted to account for the redshift uncertainty. Green asterisks show four data points at 
intermediate masses from BE08. Finally, the red squares depict our COSMOS results which extend previous results to lower masses and to 
higher redshifts. Three arrows highlight the highest redshift COSMOS data points (bins Aq, A7 and Ag). The grey shaded region shows the 
upper and lower envelope of the ensemble of lines with a slope and intercept that lie within the 68 percent confidence region of "R.1m-l x ■ 



explored in this work is the fact that each of our stacks 
contains a relatively small number of groups. Indeed, 
the assumption of spherical symmetry will begin to break 
down for stacks that only contain a small number of ob- 
jects and the weak lensing signal may be contaminated 
from projection effects that have not averaged away. We 
have tried to limit this effect by discarding all systems 
with visible projections along the line-of-sight . In total, 
the various quality cuts described in Sj3] are such that 
about 30% of the initial sample is rejected from the lens 
catalog. Most of these cuts were linked to the quality 
of the X-ray data and to projection effects. Additional 
XMM-Newton and Chandra data would increase the size 
of our lens sample and would help reduce the statistical 
error on the mass measurement. 

8. SUMMARY AND DISCUSSION 

In this work, we have used a sample of 206 X-ray de- 
tected galaxy groups to investigate the scaling relation 
between total mass and X-ray luminosity where M200 is 
derived via weak gravitational lensing. In the following 



paragraphs, we present a summary and discussion of our 
main results. 

The combination of group and cluster data. 

The COSMOS group catalog spans an approximate mass 
and redshift range M 2u o ~ 10 13 h^ 2 to Af 2u o ~ 
lO 14 /i7 2 1 M with 0.2 < z < 0.9, a new parameter space 
in terms of weak lensing-based mass measurements of X- 
ray detected groups and clusters of galaxies. When ap- 
propriately scaled for self-similar redshift evolution, the 
COSMOS data, alongside previously published results 
display a remarkable power-law relation that spans over 
three orders of magnitude in LxE(z)~ 1 and two orders 
of magnitude in M2ooE(z). The COSMOS data alone are 
well fit by a power law, M 20 o esc (Lx) a , with a slope of 
a = 0.66±0.14. By combining with previously published 
cluster data, we derive a tighter constraint on the slope, 
a = 0.64 ± 0.03. This is inconsistent at the 3.7a level 
with the self-similar prediction of a — 0.75. Note that 
the combination of group and cluster data greatly helps 
constrain the slope of the M — Lx relation but the deter- 
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Fig. 7. — Tests for systematic errors. We implement four tests that are designed to check for errors in the foreground-background 
separation scheme, to probe the effects of mis-centering, and to quantify the impacts of variations in the mass-concentration relation. The 
lensing signal is re-computed for each test and the results are compared to the masses used to calculate 1ZXl x —m ■ The data points have 
been offset on the X axis by a constant value for visualization purposes. The grey shaded region shows the upper and lower envelope of the 
ensemble of lines with a slope and intercept that lie within the 68 percent confidence region of 7?.l£ x — M' All the effects that are mentioned 
above are found to have a negligible impact on the results. 

mination of the normalization is at present limited by the 
accuracy of weak lensing measurements. Improvements 
in weak lensing methods and larger group and cluster 
samples will be necessary in order to improve constraints 
on the normalization of the M — Lx relation. 



Comparison with previous lensing results. Our 

analysis compares best with the local SDSS results from 
R08 who find L x oc [M 20 qY with f3 = 1.65 ± 0.13. The 
inverse of their slope, 1/(3 — 0.61 ± 0.048 is in excel- 
lent agreement with both 1Z1m-l x an d T^m-Lx- As 
described in their paper, because the R08 results are 
binned by richness, the slope of their relation is sensi- 
tive to the correlation coefficient between richness and 
Lx at fixed mass, r NL \ M (see §4 in R08). A value of 
r N.L\M — ±0.7 would change their slope to 0.68 ± 0.061 
and 0.54 ± 0.038 respectively. These slopes would still 
be in relatively good agreement with the COSMOS re- 
sults but much larger values than rjv,L|A/ = ±0.7 can 
be ruled out. Small values for t^l^i are a ls° favored 
by the analysis of |Rozo et al. (2008 1 who find a value of 
r N,L\M ~ 0-05 (Rozo pnv. comm.). 

B A07 have also published a M — Lx relation but find a 
slope of 1.20±0.16 that is inconsistent with the COSMOS 
value at the 3.4<r level. We suspect that this disagree- 
ment stems from an underestimate of the BA07 masses, 
in particular at the low-mass end of the cluster sample. 
Indeed, H07 have analyzed almost exactly the same set 
of clusters but find a slope and normalization that is 
in better agreement with both this work and R08 (see 
Figure [6] and also Figure [9]). Nevertheless, despite sys- 
tematic differences between the mass estimates of H07 
and BA07, the combined relations using either data set 
with COSMOS are almost identical. Manifestly, the ad- 
dition of group data has significantly reduced the impact 
of cluster mass uncertainties on the M — Lx relation. 

Comparison with previous X-ray results. The 

comparison of the normalization of the M — Lx relation 
derived with lensing on the one hand and with X-rays on 



the other hand is of great interest because it has the po- 
tential to reveal systematic biases in X-ray-based cluster 
mass estimates (e.g. due to non-thermal processes such 
as turbulence and cosmic-rays). However, the compar- 
ison of the normalization between the M — Lx relation 
and the Lx — M relation is complex because it depends 
on (Ti n m an d the slope of the halo mass function (see Ap- 
pendix) and we leave this aspect for future work. There- 
fore, we mainly focus on a comparison of the slopes. For 
this purpose, we have compiled a list of the slopes of the 
Lx — M relation as determined by various X-ray studies. 
When authors have stated their results in terms of the 
Lx — M relation, we have inverted the slope in order to 
compare with our results 8 . In the Appendix we show 
that although the slope of P(Lx\M) is not exactly equal 
to the inverse of the slope of P(M\Lx), the difference is 
small (also see Figure [2). 

A representative (but not exhaustive) list of X-ray- 
based results is the following: 



Reiprich & Bohringer (20021: L x oc M^ 49 ^ 089 



'200 



or L x oc M 2 i °^ ±U U8t> depending on the fitting 
method (extended sample, Table 7 of their paper). 
The inverse slopes are 0.67 ± 0.04 and 0.60 ± 0.03, 
respectively. 



Allen et al. 



(20031: M 200 oc L 



0.76! 



Popesso et aT] ( |2005| : L x oc M^f ±a23 . The in- 
verse slope is 0.63 ± 0.09. 



Chen et al. ( |2007 1 : L x oc M^ Q f ±0 - 13 . The inverse 
slope is 0.55 ± 0.04. 



Pratt et al 
^i.yb±u.ii 



(2008): L x oc Mb 81±0S0 or L x oc 



depending on the fitting method (Table 



Errors on y = — have been computed as Ay = ( — ) X Ax 
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Fig. 8. — The effects of photometric redshifts errors on group-galaxy lensing signals. Panels a) and b) illustrate the quality of the 
photometric redshifts used in this paper by comparing them to a combined sample of 12367 spectroscopic redshifts from the zCOSMOS 
"bright" and "faint" programs. There are three ways in which photometric redshift errors can impact group-galaxy lensing signals. Firstly, 
any type of photometric error such that Zp^ot < zi cns will have no effect on the signal because such objects are not included in the 
background selection (bottom hashed region). Secondly, photometric errors such that z p hot > z lcns an d z spcc < zi cns will lead to a signal 
dilution (left hashed region). Finally, photometric errors such that Zp^ot > zi cns and z spe c > zi ena but z p hot 7^ z spec will lead to a bias in 
AS because S cr i t will be mis-estimated when transforming 7 into AS. The dotted and dashed lines in panels a) and b) indicate where 
photoz errors lead to a 10% and a 20% error on AS. As can be seen, group-galaxy lensing signals are increasingly insensitive to photometric 
redshift errors at higher source redshifts. This can be understood by looking at the variation of S cr i t as a function of source redshift as 
shown in panel c). Indeed, S cr i t varies strongly near zi ena but flattens considerably at z SO urcc > 2zi cns . The arrows in panels a) and b) show 
where the mean COSMOS source redshift lies for lenses at zi ens =0.2 and zi ens =0.9. The bias in AS for group-galaxy lensing in COSMOS 
is estimated to be less than 7% for lenses below z = 1. 



2 in their paper). The inverse slopes are 0.55 ±0.03 
and 0.51 ± 0.029. 



• |Vikhlinin et al. ( 2009 ): L x 
inverse slope is 0.62 ± 0.05. 



M. 



1.61±0.14 
200 



The 



Figure [9] shows a comparison between X-ray based es- 
timates 01 the slope of the M — Lx relation and lensing- 
based results. As can be seen in this Figure, most of the 
lensing and the X-ray results are in excellent agreement 
with an average slope of a ~ 0.64. 

Evolution of scaling relations. Large surveys that 
will probe clusters up to z = 1 will need a precise un- 
derstanding of the redshift evolution in mass-observable 
relations. In this paper, we have shown that weak gravi- 
tational lensing is capable of meeting this challenge and 
we have tested several evolution scenarios from z ~ 0.2 
to z ~ 0.9. Our results are consistent with the evolution 
rate predicted by self-similarity but our errors are still 



relatively large due to our small sample size at high red- 
shift. Additional X-ray data would help improve the pre- 
cision of this measurement. More precise measurements 
of the redshift evolution of X-ray scaling laws would also 
help constrain the physical processes that govern the 
heating and cooling of the ICM. 

Self-calibration methods. Sufficiently large cluster 
surveys may be able to deal with the evolution and scat- 
ter in mass-observable relations by internally calibrat- 
ing for th ese uncertainties (the so called "self-calibration 
method'MLevine et al.|2^02l|Hu|2003l|Maiumdar fc Mohr| 
20041 IWang et al.|2004b Lima & Hu 2005). This method 



treats all uncertainties as tree parameters to be fitted 
along with the desired cosmological parameters. How- 
ever, there are several drawbacks to this method. The 
first is that treating systematic errors in this manner 
weakens the final statistical constraints. The second is 
that self-calibration requires a parametric form for the 
scatter and evolution of scaling relations. Bias is intro- 
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Fig. 9. — A comparison of the slope of the M — Lx relation obtained by various authors. From left to right we show TZXm—L x and 
TLIm—Lm (this work) followe d by the lensing based results of Rykoff et al.| (|2008^ and|Bardeau et al.|||2007|l and then by X- ray-based results 
| |Reiprich fc Bohringer|2002| [AUen et al.|2003| |Popesso et al.||2UU5| |Chen eHu^|20U7l |Fratt et al.|2008| | Vikhlinin et al.||2009[ |. The solid 
blue shows the seli-similar prediction tor the slope which is a = U.Yo. t he grey shaded region indicates the one sigma errors lor 1Z2m—l x ■ 
With the exception of the BA07 results, the lensing and the X-ray results are in good agreement with an average slope of a ~ 0.64. Note 
that because of scatter in the Lx — M relation and a halo mass function with a varying slope, the lensing and the X-ray results are not 
directly comparable. The dashed black line indicates the value of the COSMOS+H08 data point corrected for the difference between 1//3 
and Qi ens i n g assuming a scatter of cri nAf = 0.25 (see derivations in Appendix). 



duced if this parameterization is incorrect. 

This paper has showed that weak gravitational lensing 
can help constrain the actual form of the mass-observable 
relations as well as their evolution, even out to high red- 
shifts (z < 1). Having a (direct observational) exter- 
nal constraint will help boost the accuracy achievable 
with self-calibration methods by reducing the number 
of parameters and by pinning down the correct para- 
metric form. One important ingredient in the M — Lx 
relation, and for self-calibration exercises, is the scat- 
ter <J\ n M- Stacked weak lensing measurements are not 
suitable for measuring the scatter and so individual lens- 
ing detections will be necessary for this task. Conse- 
quently, estimates of the scatter with lensing will be lim- 
ited to high masses and moderate redshifts (see Figure 
[T]). Space-based data will be optimal for scatter studies 
Lecause high source densities will increase the mass and 
rcdshift range for which clusters can be directly probed. 
The challenge for lensing-based estimates of the scatter 
will be to ensure that the lensing errors are smaller than 
the intrinsic scatter, a condition that is probably not 
achieved at present. 

Nevertheless, one interesting point to note is that scat- 
ter will introduce a small amount of curvature in stacked 
lensing based measurements of the M — Lx relation (see 
Figure |2j. Therefore, it is possible that the scatter 
could be measured with future data by constraining the 
amount of curvature in M — Lx at high halo masses. 

In conclusion, the field of weak gravitational has 
started to become a truly competitive tool for calibrating 



the relation between the total mass of groups and clusters 
of galaxies and their baryonic tracers, over a wide range 

of masses (M 200 ~ 10 13 - 5 -10 15 - 5 /if 2 lM ©) and up to z = 1. 
At present, the slope of the M— Lx relation is constrained 
by lensing with a statistical significance that is compa- 
rable to X-ray studies (at the ~ 5% level). Although 
further work will be necessary in order to compare both 
the slope and the normalization, it is encouraging to note 
that both lensing and X-rays studies are already in good 
agreement with respect to the slope of the M — Lx re- 
lation (a ~ 0.64). The observational foundation for the 
calibration of the mass-observable relations that are es- 
sential for cosmological studies with clusters of galaxies 
is clearly growing firmer. 
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APPENDIX 

In this Appendix, we derive the relation between P(M|Lx, z) and P(Lx\M, z) and we show that a correction needs 
to be applied when comparing the slopes of the mean log luminosity and the mean log mass over a wide range in halo 
mass. The slope of the mean relation associated with P(M\Lx, z) is noted aie nsina and the slope of P{Lx \M , z) is 
noted (3. Partial aspects of this derivation can also be found in the Appendix of Mandelbaum & Seljak ( |2007 1. 



Let m = InM and I = InLx where M represents halo mass. Following Stanek et al. "j 2UUb| , we assume that the 



conditional probability distribution of the luminosity given the mass is log-normal. In this case, P(l\m, z) is Gaussian 
and we can write that: 

1 (—[I - h{m,z)} 



atV2^ V 2<t, 



P(l\m, z) = — = x exp -i " v 2 ' /J , (1) 



where lo(m, z) is the mean log luminosity and 07 is the scatter (also noted o~i n L x )- We assume that the scatter varies 
neither with mass, nor redshift (07 is constant) and that the mean log luminosity follows a power-law scaling relation 
with mass and with self-similar redshift evolution: 

(In L x ) = lo(m, z) = (3m + (1 + (3) In E(z) + B. (2) 

Let n(M) represent the number of dark matter halos per unit volume with mass less than M. Locally, the differential 
mass function is a power-law of the form dn/dM cx M~ 7 = e -7 ™. The probability of observing a halo of log-mass m 
is P(m) = dn/dlnM cx e -(7-i)™. 

The weak lensing signal of halos stacked according to Lx depends on P(M|Lx,£), the conditional probability 
distribution of the mass given the luminosity. Using Bayes theorem and ignoring those terms that only contribute to 
the overall normalization of P(m\l, z), we can write that 

P(m\l, z) cx P(l\m, z)P(tn, z) cx exp ^ — E - — (7 — l)m^J . (3) 

By using Equation [2] to develop the expression within the exponential and by completing the square, we obtain that 

P(m M oc exp ( - [m j7± Z)? )> (4) 



2(a i /3- 1 ) 2 

where (InM) = mo(l, z) = (1/(3)1 — (1 + 1/(3) InE(z) — B / f3 — 0^(7 — 1). In other terms, P(m\l, z) is Gaussian and 
P{M\Lx, z) is log-normal with a dispersion in mass equal to a m = 07 //3 and with a mean log mass that follows a 
power-law of with a slope of Oii ens i n g = 1//3. The mean log-mass, mo(l, z), can be obtained by solving for the mass in 
Equation [2] with the addition of an extra term. In this sense, there is a bias between mo(l, z) and the true mean mass 
which is equal to b m — of n (7 — 1) and this bias scales linearly with 7 — 1 where 7 is the slope of the mass function. 
Because, (7 — 1) is always positive, this bias causes mo(Z, z) to be biased low relative to the true mean mass. 

Both mass and luminosity are often expressed in units of logarithm to the base 10 rather than in units of natural 
logarithm. In this case, the scatter in mass at fixed luminosity is equal to 0~\ o g m = o~ m w = fm/ ln(10) and the bias is 
equal to 6 m io = <Jmw(l ~ 1) m (10) = 0^(7 — 1) / ln(10). Note that there is an extra factor of ln(10) in the expression 
of the bias due to the fact that P(log 10 M) = dn/dlog 10 M cx Af-^ 1 ' = e -(-v-i)in(io)iog 10 (M)_ 

We have demonstrated that ai ens i ng =1/(3 when the halo mass function is locally a power-law. In reality, the slope 
of the mass function varies with both mass and redshift. At z = 0.2, the slope of the mass function varies from 7^2 
at M 2 oo ~ 10 13 (1^2 Mq to 7 ~ 5 at M 2 qq ~ 10 15 h^M®- The fact that the bias depends on 7 will result in a change 
in slope and otiensing — 1/(3 will no longer be valid. Instead, there will be a correction factor between the slope derive 
by stacked weak lensing, ai ens i ng , and 1/(3 and we will now show how this correction factor can be roughly estimated. 

Let mi and mi represent two distinct halo masses in units of logarithm to the base 10. The local slope of the 
mass function at each of these masses is noted 71 and 72 and the biases are noted b m \ — (7^71/ ln(10) and b m 2 — 
°mio72/ ln(10). The slope that is derived via stacked weak lensing over the mass range [?7ti, 77^2] is equal to otiensing — 
F/(3 where F = 1 - (b ml - b m2 )/{mi - m 2 ). For z = 0.2, u m = 0.25 (<r ml0 - 0.109), Mi - 10 13 h^M® and 
M.2 ~ 10 15 H^Mq, we have b m \ ~ 0.03 and b„a ~ 0.11 (when m is expressed in logio units). The difference between 
1/(3 and Qiensing over this mass range is therefore roughly aiensing ~ 0.96//3. 
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